Corpus: por_news_2020_10K

Other corpora

5.1.18 Words nearly always as next neighbors

Strong NN co-occurrences with a low probability of being separated

The quotient below is calculated as freq(word1)*freq(word1)/NN_freq^2.

Word 1 Word 1 Frequency of word 1 Frequency of word 2 Frequency as NN Qoutient
Estados Unidos 62 52 50 1.29
Mato Grosso 20 21 20 1.05
Reino Unido 10 10 10 1.00
cestas básicas 12 10 9 1.48
Belo Horizonte 10 9 9 1.11
Volta Redonda 11 9 9 1.22
Nossa Senhora 12 9 9 1.33
Meio Ambiente 6 7 6 1.17
Produto Interno 5 7 5 1.40
Lava Jato 7 7 7 1.00
Medida Provisória 8 6 6 1.33
Interno Bruto 7 5 5 1.40
Alta Floresta 4 5 4 1.25
Terapia Intensiva 5 5 5 1.00
Hong Kong 5 5 5 1.00
Aldir Blanc 4 4 4 1.00
pavimentação asfáltica 3 4 3 1.33
fake news 5 4 4 1.25
home office 4 4 4 1.00
Helder Barbalho 3 3 3 1.00
66 msec needed at 2021-07-07 05:01